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Historically, the element of uncertainty in quantum mechanics has been expressed through math¬ 
ematical identities called uncertainty relations, a great many of which continue to be discovered. 
These relations use diverse measures to quantify uncertainty (and joint uncertainty). In this pa¬ 
per we use operational information-theoretic principles to identify the common essence of all such 
measures, thereby defining measure-independent notions of uncertainty and joint uncertainty. We 
find that most existing entropic uncertainty relations use measures of joint uncertainty that yield 
themselves to a small class of operational interpretations. Our notion relaxes this restriction, reveal¬ 
ing previously unexplored joint uncertainty measures. To illustrate the utility of our formalism, we 
derive an uncertainty relation based on one such new measure. We also use our formalism to gain 
insight into the conditions under which measure-independent uncertainty relations can be found. 


INTRODUCTION 

Revealing one of the most striking features of quan¬ 
tum mechanics, Heisenberg [T] showed that the out¬ 
comes of certain pairs of measurements on a quantum 
system can never be predicted simultaneously with cer¬ 
tainty—regardless of how the system is prepared. Heisen¬ 
berg’s original statement of what he called the “indeter¬ 
minacy” principle concerned potential measurements of 
the position and the momentum of a quantum particle. 
Many later works [2H6] lent quantitative rigor to Heisen¬ 
berg’s original idea and generalized it, both in the num¬ 
ber and type of measurements involved and in the mea¬ 
sures used to quantify joint uncertainty. At the same 
time, Heisenberg himself set off another chain of research 
on a related concept: measurement-induced disturbance 
and so-called noise-disturbance relations IMO]. 

Pioneered by Hirschman [4], many works |llH19j have 
used entropies to quantify uncertainty, culminating in 
a recent surge of quantum information-theoretic treat¬ 
ments of the uncertainty principle |20H31) . An impor¬ 
tant contribution of these recent works is the formulation 
of uncertainty relations applicable on a quantum system 
correlated with a quantum memory; such relations are 
used to strengthen the security proofs of cryptographic 
tasks [32l [33] . These are in addition to existing applica¬ 
tions of uncertainty relations in quantum cryptography 
[5il - l5S] . the study of quantum nonlocality [371131], and 
continuous-variable quantum information processing |3i- 

sg. 

In all of these areas, the primary ingredient is the con¬ 
cept of the uncertainty of a variable, as well as that of 
the joint uncertainty of several variables. The aim of this 
paper is to clarify these concepts from an information- 
theoretic perspective. In the literature, the uncertainty 
of a variable has almost always been discussed in terms of 
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measures that quantify “the amount of uncertainty”, e.g. 
the Shannon entropy and its extended family of Renyi 
entropies, geometric norm-based measures such as the 
quadratic variance, etc. In most cases, there is a clear 
operational meaning for such measures, rendering them 
well-suited to the particular application wherein they are 
used. Similarly, measures of the joint uncertainty of more 
than one variable have been constructed either by consid¬ 
ering operational tasks that involve all the variables, or 
by combining single-variable uncertainty measures math¬ 
ematically. In the present work we extract the com¬ 
mon thread beneath the operational descriptions of all 
such (single or joint) uncertainty measures, resulting in 
some basic operational axioms that are independent of 
the measure used to quantify uncertainty, and that de¬ 
fine the essence of our concept of uncertainty. 

These axioms are motivated by information-theoretic 
principles that are intended to be as objective as pos¬ 
sible. Gonsidering the challenges inherent in such a re¬ 
quirement, we restrict the generality of our treatment 
in the following ways. Firstly, we restrict to notions 
of uncertainty applied to classical random variables. In 
particular, this class of variables includes the classical 
outcomes of quantum-mechanical measurements. Sec¬ 
ondly, we avoid measures of uncertainty that explicitly 
involve the values of a variable, and instead consider only 
such measures that depend on the variable’s probability 
distribution. This necessitates a restriction to discrete 
variables; in fact, we consider only finite-dimensional 
variables. We make some tentative suggestions for the 
treatment of discrete and continuous infinite-dimensional 
cases, but leave the actual extension for future work. Fi¬ 
nally, in comparing the uncertainties of different variables 
(which a measure of uncertainty should naturally be ex¬ 
pected to enable), we will require the compared variables 
to represent the same type of physical quantity. For ex¬ 
ample, a comparison between the uncertainties in two 
different length variables will be possible within our for¬ 
malism, but not one between a length uncertainty and a 
mass uncertainty. 

The crux of this paper are the following axioms: (1) 
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One’s knowledge about a variable cannot increase un¬ 
der any processing without addition of new information 
about the variable; (2) The uncertainty in a variable 
representing a physical observable is invariant under the 
symmetries of the observable; and (3) The joint uncer¬ 
tainty of several variables is a valid concept even with¬ 
out an underlying operational description that combines 
those variables. 

The first two axioms are inspired by earlier approaches 
[24H27j to measure-independent notions of uncertainty, 
wherein the connection between uncertainty and a math¬ 
ematical concept called majorization [43] was utilized. 
Majorization is a hierarchy among probability distribu¬ 
tions, induced by the action of a class of transformations 
called doubly stochastic maps. In this paper, by finding 
a mathematical characterization of mechanisms that can 
increase a variable’s uncertainty, we gain an operational 
understanding of why, and to what extent, majorization 
plays a role in characterizing uncertainty. 

First, we find that for variables with unrestricted sym¬ 
metries, uncertainty-increasing mechanisms are associ¬ 
ated with the set of all doubly stochastic maps, leading to 
the emergence of majorization as the relation determin¬ 
ing uncertainty. A function that quantifies uncertainty 
must then possess the property of never decreasing un¬ 
der any doubly stochastic maps. On the other hand, with 
restricted symmetries, only certain sub-classes of doubly 
stochastic matrices feature. The resulting hierarchy is 
then different from majorization, and a measure of un¬ 
certainty is required to be non-decreasing only under the 
restricted classes of doubly stochastic maps. This opens 
up more options for functions that can serve as uncer¬ 
tainty measures for variables with restricted symmetries. 

Another element of novelty in our work lies in the third 
of our axioms, concerning joint uncertainty. In the con¬ 
text of physics, we can rephrase this axiom in terms of ex¬ 
periments: Suppose that we are interested in quantifying 
the joint uncertainty of several experiments, e.g. in con¬ 
nection with the quantum uncertainty principle, where 
the several experiments are different quantum measure¬ 
ments. One approach would be to construct new exper¬ 
iments that combine the original experiments in some 
way. For example, consider the following combined ex¬ 
periments constructed from a given set of experiments: 
(a) all the original experiments are performed indepen¬ 
dently; (b) all the apparatuses are set up, but only one 
of the experiments is chosen at random and performed. 

The uncertainty in the outcome of such a combined 
experiment would quantify the joint uncertainty of the 
constituent experiments. But we see that there are dif¬ 
ferent ways to combine experiments, which all capture 
different aspects of the joint uncertainty. In this paper 
we argue that the richness of joint uncertainty is not cap¬ 
tured even by considering all such combined experiments. 
The most general notion of joint uncertainty is devoid 
of the particulars of such combinations, and allows all 
the component experiments to be, in principle, counter- 
factual. To illustrate this, we consider an extensively- 


studied type of quantum uncertainty relations: the so- 
called preparational uncertainty relations. For ease of 
explanation, let’s consider a two-measurement prepara¬ 
tional uncertainty relation, which has the generic form 

J{p{p),(l{p))>c, (1) 

where is, a, measure of the joint uncertainty of two 
variables, and p(p) and q(p) are the expected outcome 
probability distributions of a pair of measurements per¬ 
formed on a quantum state represented by the density 
operator p (our arguments can be extended to more than 
two measurements). We show that most existing prepa¬ 
rational uncertainty relations can be subjected to one of 
the specific operational interpretations (a) and (b) men¬ 
tioned above. To show that these two interpretations are 
unnecessarily restrictive, we construct joint uncertainty 
measures that cannot be interpreted either way. We go 
on to derive an uncertainty relation based on one such 
measure, which is a relation nontrivially different from all 
the ones discovered in the past. The main purpose of de¬ 
riving this new relation is to demonstrate the possibilities 
opened up by our joint uncertainty axiom. 

Another contribution of this paper is a deeper un¬ 
derstanding of so-called universal uncertainty relations 
found in [24U27] : pairs of vectors (u, v) such that J{u, v) 
provides a nontrivial bound [like the c of Eq. 0 ] for a 
whole class of measures, S J. We find that no univer¬ 
sal relations exist if J includes all possible measures; how¬ 
ever, restricting to specific operational frameworks [using 
the (1) and (2) types of combined experiments discussed 
in the previous paragraph] is what makes the nontrivial 
universal relations found in [241 - 127] possible. 

Even though we focus on preparational uncertainty re¬ 
lations in quantum mechanics, in principle our notions 
can be applied to any situation where probability-based 
uncertainty measures of classical variables are relevant. 
We summarize the possible applications in the conclu¬ 
sion, along with open problems. 


I. WHAT IS UNCERTAINTY? 

We will now develop a notion of uncertainty that can 
be applied to finite-dimensional classical variables. In 
particular, we seek a general method of comparing the 
uncertainties of two variables, in such a way that the 
comparison gives the same verdict independent of the 
function used to measure uncertainty. For our purpose, 
it will be sufficient to be able to compare physically simi¬ 
lar variables, that is, variables representing the same un¬ 
derlying physical quantity; for example, comparing the 
uncertainty of a length with that of another length. We 
will not concern ourselves with how a comparison can be 
made between dissimilar variables. 

Consider an experiment where Alice is about to roll a 
(possibly biased) die, whose faces she calls “1”, “2”..., 
“6”. The eventual outcome of the roll will be a value 
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X G {1,2..., 6}, but since we don’t know x a priori, we 
represent it as a random variable X = {{x,px)}- What 
is Alice’s minimum uncertainty about X prior to the ex¬ 
periment? We could answer this question in different 
ways, some of which might appeal to the particular la¬ 
bels that Alice uses to call her outcome. For example, 
the difference between the largest and smallest possible 
outcomes that have a non-zero probability is an uncer¬ 
tainty indicator, and it depends on the choice of labels. In 
principle, Alice could relabel her die’s faces to, say, “a”, 
“b”, etc., without changing the essential physical nature 
of the experiment. We will require our notion of un¬ 
certainty to make no distinction between two physically 
identical experiments that differ only in the outcome la¬ 
bels. In other words, we will consider uncertainty to be a 
property of just the distribution px , measured possibly 
by some real-valued function U{px)- In fact, an even 
stronger restriction follows. Let K be a random variable 
obtained by merely relabeling the different values of X. 
The probability distribution py of Y must then necessar¬ 
ily contain the same values as px , possibly differing only 
in their order. Therefore, the effect of any relabeling on 
Px is as though the original labels were just permuted 
amongst themselves: px M*^’"’^px. In this sense, per¬ 
mutations, although not the only possible way to relabel 
outcomes, still capture the effect of arbitrary relabelings, 
as far as our notion of uncertainty is concerned. 

If, instead of a die-roll outcome, X were a physical 
property, e.g. the energy of a quantum harmonic oscil¬ 
lator, arbitrary permutations could result in loss of the 
variable’s physical meaning. To avoid this, we would have 
to restrict the permutations, e.g. to only shifts in the en¬ 
ergy. In general, the restricted class of reorderings is the 
group G of symmetries of the observable underlying A, 
with each symmetry g corresponding to a change in one’s 
reference frame. For finite-dimensional observables, G is 
a subgroup of the group of all permutations. 

Our first requirement from a measure U of uncertainty 
is that it be invariant under the symmetry group G of the 
underlying observable. This immediately leads to the fol¬ 
lowing: Two variables X and Y, both representing the 
same observable, are equally uncertain if their distribu¬ 
tion vectors are related by some g G G. If A is the 
outcome of a certain experiment, the random variables 
Y that are equally uncertain to A include relabeled (un¬ 
der G) versions of A (which are perfectly correlated with 
A); the outcomes of other runs of the same experiment 
with the same apparatus (which may be correlated with 
A if the apparatus has a memory); outcomes of the same 
experiment performed on independent but identical ap¬ 
paratuses (uncorrelated with A); and in general any Y 
representing the same observable, with py = M^^'^px for 
some g G G. 

Thus far, we have found a way to tell when the uncer¬ 
tainties of two variables are equal. Now we will develop 
a method of determining when and how the uncertainty 
of one variable can be said to be more, or less, than that 
of another. To this end, we will first identify certainty- 



FIG. 1. Bob tries to estimate his uncertainty about Alice’s 
variable X after it has been corrupted by channel T into Y. 


nonincreasing transformations: processes that take any 
given variable A to an equally- or more-uncertain one. A, 
by virtue of a “randomizing” or “forgetting” mechanism. 
Thereafter, we will use the following rule to compare the 
uncertainties of two variables A and Y (arbitrary but 
with the same underlying physical observable): Y is at 
least as uncertain as A if some uncertainty-increasing 
transformation of A results in a variable A that has the 
same probability distribution as Y up to the symmetries 
of the underlying observable. 

In order to identify the certainty-nonincreasing trans¬ 
formations, we will now construct a couple of extended 
versions of the “Alice rolls a die” thought experiment. 
First, consider the modified experiment depicted in Fig.[2 
|44) : After rolling her die, Alice sends the outcome x to 
her collaborator Bob (who doesn’t even know the bias 
distribution of Alice’s die) via some classical channel [45] 
given by the column-stochastic matrix T = (T^i^,). Here 
let’s pause to reflect upon the uncertainty in the output 
Y of the channel. The channel could transmit x perfectly, 
or with some added noise. In these cases the output Y is 
equally or more uncertain than A. On the other hand, 
the channel could also completely ignore x and output 
some constant value, in which case the uncertainty of Y 
could be less than that of A. In fact, the processing 
might result in information in a fundamentally differ¬ 
ent form from A. For example, Alice could just send 
the parity of her die outcome to Bob, in which case Y 
doesn’t even represent the same underlying observable as 
A. Therefore, we cannot make a general statement about 
non-increase of certainty under an arbitrary channel. 

However, instead of the uncertainty of Y itself, we can 
consider the following question: How much information 
does Y contain about x7 Since Y results from processing 
A with the possible addition of noise or irrelevant infor¬ 
mation, it cannot tell us more about x than A does. In 
order to lend mathematical rigor to this statement, we 
must extract from Y some variable that has the same 
physical meaning as A, so that we can treat them both 
on an equal footing. Now let’s return to Alice and Bob’s 
experiment: Bob, who knows T but not px, now tries 
to recover x from the channel output y, which is a pri- 
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ori distributed according to qy = Tpx- Since this game 
is being designed to analyze uncertainty about x, Bob’s 
aim in his recovery task is not to maximize his chances 
of guessing x correctly, but rather to faithfully account 
for the uncertainty that Y contains about x. Suppose he 
sees an instance Y = y. This could have resulted from 
a particular X = x' with conditional probability Ty^^i. 
Without knowing the prior px, Bob’s rational guess for 
the likelihood that X = x (among all the possible x') is 
given by 




T, 


y\x 




y\x' 


( 2 ) 


The resulting distribution of Bob’s recovered variable 
(call it X) is given by the composite action of T and 
R on px: 


pj = RTpx=.D^^-px. (3) 

Since Y could contain irrelevant information, its uncer¬ 
tainty cannot be interpreted as “uncertainty about x”. 
On the other hand, X directly represents x, while X re¬ 
sults from extracting out of Y precisely all the informa¬ 
tion it contains about x. Therefore, these two variables 
both represent the same physical observable as x, and 
their uncertainties directly quantify uncertainty about x. 
This equal physical footing also ensures that their uncer¬ 
tainties can be compared under our rules. This compari¬ 
son tells us that the uncertainty of X cannot be less than 
that of X. The cumulative transformation that takes 
X to X is therefore a certainty-nonincreasing transfor¬ 
mation. It can be verified easily that for any column- 
stochastic T, with the corresponding R [3^ constructed 
as in ([^, the matrix = RT is doubly stochastic. 
The (necessarily degenerative) evolution of the informa¬ 
tion about some entity (like x), when the representation 
of this information is subjected to any classical process¬ 
ing (represented by the action of the channel), is always 
via such matrices, whose collection we call I?”®®. 

We saw that, after the action of a generic channel, the 
uncertainty of the final variable Y doesn’t have a consis¬ 
tent hierarchical relationship with that of the initial vari¬ 
able X. In order to draw a consistent rule of certainty 
non-increase, we had to consider a recovery transforma¬ 
tion from Y to X. But there are certain special trans¬ 
formations that always result in certainty non-increase, 
even without the addition of a recovery transformation. 
In fact, we already saw an example: symmetry trans¬ 
formations of the underlying physical observable. In the 
die-roll example, symmetry transformations include non¬ 
identity permutations, which can easily be shown to be 
outside of the die’s I?®®® class, yet result in final vari¬ 
ables Y with the same physical meaning and (consis¬ 
tently) no less uncertain than X. We will find a family 
of such certainty-nonincreasing transformations by con¬ 
sidering another thought experiment, depicted in Fig. 
Before rolling her die, Alice will toss a coin; she will then 



FIG. 2. Extended die-roll experiment: Alice relabels her die 
based on a coin toss, then rolls the relabeled die. The outcome 
of this experiment is more uncertain than a simple die roll. 


relabel the die’s faces with a permutation that is deter¬ 
mined by the outcome of this coin toss, and then roll the 
die. The random choice of relabeling makes the outcome 
Y of this modified experiment more uncertain than X. 
In general, if a variable X is transformed by applying a 
g G G chosen at random under a distribution t = (tg), 
the resulting variable Y is distributed as = D^^’^px, 
where = '^g^QtgM^^K Since each is a per¬ 
mutation, every possible is doubly stochastic. We 

denote by 1?®^™ the set of all such T)®^™ matrices. If 
the observable’s symmetry group G includes all permu¬ 
tations, then by Birkhoff’s theorem [371 [35] 1?®^™ is the 
set of all doubly stochastic matrices, but a restricted G 
results in a corresponding shrinkage of 

The characterization of the classes I?''®® and 1?®^™ is an 
interesting problem that we leave for future work. While 
the latter class depends on the symmetry group of the ob¬ 
servable, the former depends only on the dimensionality. 
For a variable with complete permutation symmetry, as 
noted above, 1?®^®® contains all doubly stochastic matri¬ 
ces, in particular all of I?®®®. But under restricted sym¬ 
metries, each class can contain members not belonging 
to the other. For instance, take a 3-dimensional variable 
whose symmetry group is the (order-3) group of cyclic 
permutations of the components. The two nontrivial per¬ 
mutations are transformations contained (by design) in 
psym, T)®®®. On the other hand, the matrix 


/I 0 
0 0.5 
\0 0.5 


0 

0.5 

0.5 


is in I?®®®, but not in T)®y®®. Therefore, the structure of 
the union of these classes cannot be reduced to either one 
of the classes. This example can be generalized naturally 
to higher dimensions. 

Due to our restriction to uncertainty comparison be¬ 
tween physically-similar variables, the “sym” and “rec” 
classes of doubly stochastic matrices together suffice as 
mechanisms of uncertainty increase. In principle, any 
function U{px) meant to measure the uncertainty of X 
is required to increase under both these matrix classes. 
But the “sym” class is more important that the “rec”: 
the former is based on the natural symmetries of an ob- 
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servable, and therefore the constraints that it induces on 
uncertainty measures are inviolable. On the other hand, 
“rec”, even though it is an essential ingredient in the 
strictest information-theoretic definition of uncertainty, 
could be ignored in natural situations where information¬ 
processing is not involved. Functions that respect the 
“sym” constraints, but violate the “rec” ones, neverthe¬ 
less turn out to be useful indicators of uncertainty. Based 
on these considerations, we dehne: 

Definition 1. A measure of uncertainty of a variable X 
is a function U of the distribution p = px of the variable, 
satisfying 

U{Dp) > U{p) VD G (4) 

U{Dp) > U{p) VD G (5) 

Here the class “sym” is determined by the symmetries of 
the variable’s underlying physical observable. A function 
that satisfies but not will be considered a weak 
measure of uncertainty. 

If the symmetry group G of a finite-dimensional X 
contains all permutations, then functions that satisfy 0 
are called Schur-concave functions [13] . Examples of such 
functions are the entropies of Shannon, Renyi, and Tsal- 
lis. Now, Hardy et al. [49] proved that the existence of 
a doubly stochastic D such that q = Dp is equivalent 
to the binary relation p q, read “p majorizes q” [33] . 
which for a general d-dimensional vector space is defined 
as follows. Define p'^ and q'l as the same vectors with 
their components arranged in nonincreasing order. Then, 
p q if, and only if, 

k k 

VfcG {l,2...,d}. (6) 

i=l 

The “completely certain” and “completely uncertain” 
distributions e = (1; 0 ...; 0) and u = (1/d; 1/d ...; 1/d) 
satisfy e >- p u, Vp. 

If a variable has restricted symmetries, then the un¬ 
certainty hierarchy of its distributions becomes different 
from the majorization hierarchy. All Schur-concave func¬ 
tions still remain valid uncertainty measures. But in 
addition, by virtue of the reduction in the class 
some non-Schur-concave functions could also qualify to 
be weak measures of uncertainty [i.e., may violate condi¬ 
tion ©]• For example, for a finite-dimensional variable 
X whose symmetries are cyclic permutations, it can be 
easily shown that the variance of X is only a weak uncer¬ 
tainty measure. Generalizing the classes and 

for discrete-infinite and continuous variables may not be 
straightforward, and we leave it for future work. We ex¬ 
pect it to be possible to achieve such a generalization 
by considering parametrized families of symmetries (e.g. 
Lorenz transformations) and convex combinations (inte¬ 
grals) over different parameter assignments. 


II. JOINT UNCERTAINTY 

The uncertainty of the outcomes of individual experi¬ 
ments cannot provide a complete description of the quan¬ 
tum uncertainty principle, since most uncertainty rela¬ 
tions are lower bounds on measures of the joint uncer¬ 
tainty of the outcomes of at least two measurements. 
For clarity of discussion, here we will restrict to pairs of 
experiments, each with a finite number of possible out¬ 
comes; extension to more experiments is straightforward. 
To motivate our definition of joint uncertainty, consider 
the following hypothetical scenarios involving the joint 
uncertainty of a coin-toss outcome, X, and a die-roll out¬ 
come, Y: 

Example 1: Perform the combined experiment compris¬ 
ing an independent and simultaneous performance of both 
the original experiments [Fig. (a)]. The outcome is 
Z = {X,Y), which has |Ar||y| = 12 possible values, dis- 
trubuted as pz = Px G Py- Therefore, U{px G Py), for 
U any single-variable uncertainty measure (in the sense of 
Def. 1), serves as a joint uncertainty measure of X and 
Y. Most measures considered in the literature on the 
quantum uncertainty principle, e.g. the sum of Shannon 
entropies of the individual outcome distributions, can be 
interpreted through such a combined experiment. 
Example 2: This time we first toss a second coin to make 
a choice between the actions “toss the coin” (resulting in 
outcome X) and “roll the die” (leading to Y), and then 
perform only the chosen action [Fig.|^(b)]. The outcome 
Z of this experiment has |A| -|- |Y| = 8 possible values, 
whose uncertainty (modulo the uncertainty in the choice 
of action) is also a manifestation of the joint uncertainty 
of (A, y). In this case, if the choice coin is unbiased, 
Pz = {^Px ® ^Py) and therefore we get measures of 
the form U (^Px ® |py)- The measures of joint uncer¬ 
tainty proposed in [37] can be interpreted through such 
a combined experiment. 

As these scenarios illustrate, there could be differ¬ 
ent ways in which experiments could be combined into 
one super-experiment, the uncertainty of whose outcomes 
then reflects an aspect of the joint uncertainty of (A, Y). 
But the essence of joint uncertainty is not quite cap¬ 
tured by any one of these joint experiments. In fact, 
some joint uncertainty measures, such as the functions 
Ha{px) + Hgipy) (where Ha and Hp are Renyi en¬ 
tropies) [T3], and even Heisenberg’s AxAp, cannot be 
interpreted as the uncertainty of any single combined 
experiment. The quantum uncertainty principle applies 
also to cases with several potential measurements, each 
a potential (actual or counterfactual) experiment in its 
own right. 

These considerations indicate that the notion of joint 
uncertainty is not bound to the concept of combined ex¬ 
periments. What, then, are the essential properties of a 
measure of joint uncertainty? Firstly, the pairs (A, A) 
that have the least joint uncertainty are ones where both 
distributions are completely certain. The most jointly- 
uncertain pairs, on the other hand, are the ones where 
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FIG. 3. Two possible operational combinations of a coin-toss experiment and a die-roll experiment: (a) Both experiments are 
performed simultaneously and independently; (b) Only one of the two experiments is performed, based on a random choice. 


both variables are completely uncertain. Furthermore, 
all the measures of the joint uncertainty of {X,Y) are 
real-valued functions of the distributions p = px and 
q = qy, and must reduce to the measures of single¬ 
variable uncertainty (as in Def. 1) if one of the vectors 
p and q is kept fixed. This brings us to the following 
definition: 

Definition 2 . A measure of joint uncertainty of two 
variables X and Y is a real-valued function J of (p, q) = 
(pxjqv), such that 

J (Dip, Daq) > J (p, q) (7) 

for all doubly stochastic matrices Di, D 2 in the respective 
"sym” and “rec” classes of both variables. As in the 
single-variable case, we will call functions satisfying & 
for the “sym” class, but not for the “rec” class, weak 
measures of joint uncertainty. 

It can be verified that this definition applies to entropic 
joint uncertainty measures of the form /(p)+g(p), where 
/ and g are single-variable uncertainty measures. The 
vast majority of the literature on entropic uncertainty 
relations uses such measures. Note that if the symmetry 
groups of both variables are the respective full permu¬ 
tation groups, then Di and D 2 can be any two doubly 
stochastic matrices of appropriate dimensions. In this 
case, the relation in Q states that ff is monotonic un¬ 
der the direct product relation defined by: 

(pi,qi) > (P2,q2) (Pi P2 and qi ^ q2). 

III. THE QUANTUM-MECHANICAL 
UNCERTAINTY PRINCIPLE 

The “uncertainty principle” of quantum mechanics is 
actually a collection of identities known as uncertainty re¬ 
lations (UR’s), all concerning the uncertainties of individ¬ 
ual quantum-mechanical measurements, as well as joint 
uncertainties of sets of two or more (actual or counter- 
factual) measurements. Broadly, there are three different 
operational contexts of UR’s: different measurements ap¬ 
plied on the same quantum state (either counterfactually 
or by preparing many copies of the same state); simulta¬ 
neous (approximate) execution of several measurements; 


and sequential execution of several measurements. The 
notions that we developed in the last two sections can be 
applied in all of these contexts, since they all include in¬ 
stances of finite-dimensional classical variables. But here 
we will focus on the first type of situation, where different 
measurements are considered on identical preparations. 
Furthermore, we restrict to UR’s that involve only the 
probability distributions of measurement outcomes, and 
not the “values” assigned to the outcomes. 

Since these UR’s involve only the probabilities of out¬ 
comes, a positive-operator-valued measure (POVM) de¬ 
scription of measurements is adequate in the formal¬ 
ism. Consider the case of two POVM’s A = {Haja 
and B = {Fb}b. For a quantum state p, measurement 
A leads to outcome probability distribution p{p) where 
Pa{p) = Trpap], and B to q(p) with q^ip) = Tr[Ft,p]. 
For a so-called incompatible pair of POVM’s {A, B), there 
is no p that results in both p(p) and q(p) completely cer¬ 
tain, leading to the existence of a “minimal joint uncer¬ 
tainty”. Many UR’s are statements to this effect: 

^(p(p),q(p)) > c Vp, (8) 

where ff is a measure of joint uncertainty, and 0 < 
c < Cj {A,B) minpj' (p(p),q(p)). In some relations 
(e.g. Robertson’s), c is not a constant but rather a non¬ 
negative function of p. The disadvatage of such a lower 
bound is that it can be zero in some cases even if A and 
B are incompatible. For this reason, state-independent 
c’s are favored in most of the recent literature. 

In general, our analysis of uncertainty and joint uncer¬ 
tainty enables us to unify the understanding of all UR’s 
of the form 

J{Piip),P 2 {p)---,Pn{p))>c, (9) 

where is a (strong or weak) joint uncertainty measure 
(under a generalized version of Def. 2) of the n probability 
distributions (pi..., p„) that result from measurements 
(Ml ... ,An) (counterfactually) applied to the same state 
p. A vast number of UR’s reported in the literature, 
including most entropic UR’s, take this form. In fact, 
most of the entropic UR’s found so far fall under a much 
stronger restriction. As we mentioned in the previous 
section, they can all be constructed upon specific no- 
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tions of joint uncertainty based on the “combined exper¬ 
iment” scenarios where either all the measurements are 
performed on independent, identically prepared quantum 
systems [as in Fig. (a)], or a random choice is made to 
decide which of the several measurements to perform [as 
in Fig. (b)]. All entropic relations based on joint un¬ 
certainty measures of the form /(p) + /(q), where / is 
an entropy function that is additive under tensor prod¬ 
ucts, fall under this category. Going beyond these oper¬ 
ational notions and using our general definition of joint 
uncertainty enables us to construct new UR’s, with the 
following “recipe”: 

1. Find a measure of the joint uncertainty (under a 
restricted class of symmetries, if applicable) of the 
desired number n of distributions, based on Def. 2; 

2. For the given n measurements, find a lower bound 
on the n-joint uncertainty of the outcome distri¬ 
butions of the measurements applied to quantum 
states, like the c in ® . This bound leads to an as¬ 
sertion of the form (IM, i.e. an uncertainty relation. 

As an illustration, we derive an uncertainty relation 
for two rank-1 projective measurements on pure states 
of a 2-level system, using the following joint uncertainty 
measure constructed using Def. 2: J 2 (p, q) = 1 — P'^ • q'^. 
Here (•) denotes the usual dot product. Note that this 
measure of joint uncertainty is faithful in the sense that it 
is zero if and only if both vectors p and q are completely 
certain. We find that, for p and q the outcome distribu¬ 
tions of the projective measurements with respect to two 
arbitrary orthonormal bases {[xi), [ 2 : 2 )} and I 2 / 2 )}, 

min J2(p,q) = ^(1 - (10) 

where ij := max^j- \ {xi\yj)\. We provide the proof in the 
Appendix]^ Since the measure J '2 cannot be interpreted 
based on the two combined-experiment scenarios under 
which most existing entropic UR’s fall, or indeed based 
on any single-experiment scenario, the above UR is non- 
trivially different from all previous ones. More generally, 
for d-dimensional p and q, any joint uncertainty mea¬ 
sure constructed as a Schur-concave function the vector 
(Pidi 5 7*292 ■ • • ^Pd^d) i® ^ valid joint uncertainty measure 
of (p, q). So is any Schur-concave function of vectors 
of dimension k < d constructed with the components 
(Pi + 9 i 5P2 + 92 ■ • ■ + 9t)' These are just a handful 

of examples that we contrived for illustration, suggesting 
that a rich variety of UR’s could be obtained by allowing 
joint uncertainty measures that don’t yield themselves 
to interpetation as the outcome uncertainty of any single 
experiment. 


IV. UNIVERSAL UNCERTAINTY RELATIONS 

We could construct various uncertainty relations using 
the aforementioned recipe, with the given pair [A, B) and 


different measures ff. Every relation is stated in terms of 
a lower bound like the c of (|^ , which in turn depends on 
ff. In general, for a given ff it might be hard to compute 
such a bound. But suppose there were a fixed pair (u, v) 
of distribution vectors, such that 

5/[p(p)5q(p)] > 57 (u,v) Vp,J. (11) 


If there were such a pair, then for any given j7o we would 
merely have to compute j7o(u, v), immediately yielding a 
bound. In this sense, finding such a pair would amount to 
finding a plethora of uncertainty relations; therefore, such 
a pair can be said to constitute a universal uncertainty 
relation for the pair {A,B) [211 [2S]. 


As it turns out, a nontrivial pair satisfying (11) never 


exists for any given (A, S), because the clause “VJ'” in 
© includes all single-uncertainty measures of p and q 
alone, leading necessarily to the trivial choice (uo,vo), 
where Uq p(/o) and vq >- q(/9) for all p. Such a (ug, Vg) 
would be unhelpful in that it wouldn’t impose joint re¬ 
strictions on (p,q). In order to avoid this triviality, we 
can relax the condition “VifT”, and instead require the 


inequality in (11) to only hold for some restricted class 
of J’s. 

Here we consider again the two restricted combined- 
experiment scenarios that we discussed in Section [IT] In 
the first scenario, both A and B are carried out indepen¬ 
dently of each other on copies of the same state p. The re¬ 
stricted class of joint uncertainty measures then consists 
of functions of the form J (p(p), q(p)) = U (p(p) ® q(p))- 
In Ref. [211 [2S], it was shown that for any given M, B there 
exists a distribution vector u = uj{A,B) such that the 
pair (u,e 2 ) forms a universal uncertainty relation under 
this restricted class of joint uncertainty measures. 

Similarly, following Example 2 of Section [Hi we can 
consider a combination wherein we first pick, at random, 
only one of the two measurements A and S, and then 
perform that one. The joint uncertainty measures con¬ 
sidered here are of the form U (p(p) 0 q(p))- A nontriv¬ 
ial (u, v) for this restricted class can be found using the 

methods in [2ai23- 

It might be possible to unify the spirit of the above two 
classes of universal relations into a larger class, by includ¬ 
ing all measures of joint uncertainty that are symmetric 
in the two (or more) distributions: >7(p,q) = >7(q, p)- 
This requirement avoids the case of trivial relations re¬ 
sulting from the requirement (u, v) > (p,q), but we 
leave it open whether a nontrivial (u, v) can be found. 
Another way of unifiying several classes of universal re¬ 
lations, each with its respective (ui,Vi), is by bounding 
any measure J as follows: 


57(p(p),q(p)) > min J"(Uj,Vj) Vp,J. (12) 


An interesting open problem is whether there exists a 
finite integer m such that minimizing over all j < m 
provides a nontrivial bound for all nontrivial joint uncer¬ 
tainty measures. 




Universal uncertainty relations are a powerful tool 
inasmuch as they generate a variety of uncertainty re¬ 
lations, but the bounds they yield may not be tight. Be¬ 
sides, there are joint uncertainty measures that may not 
lend themselves to inclusion in a class that admits a non¬ 
trivial universal relation, but nevertheless do provide a 
nontrivial uncertainty relation. An example is the mea¬ 
sure f7(p, q) = 1 — • q'^, for which we found a UR in 

the previous section. 

V. CONCLUSION 

In this paper, we identified the most basic, measure- 
independent elements of the concept of uncertainty as 
applicable to finite-dimensional classical variables. We 
based our analysis on an information-theoretic study of 
mechanisms of uncertainty increase: randomly-chosen 
symmetry transformations; and classical processing via 
channels (followed by recovery). Corresponding to these, 
we identihed two classes of doubly stochastic matrices, 
psym Uncertainty measures in the strictest 

sense must be monotonically non-decreasing under both 
these classes. 

We then took a similar information-theoretic approach 
to the concept of joint uncertainty of several variables, 
resulting in the principle that the most basic features 
of joint uncertainty measures must not depend on spe¬ 
cific operational combinations of the variables. We then 
considered quantum uncertainty relations (UR’s) of the 
preparational uncertainty type, where past works have 
always considered specihc operational combinations. Ap¬ 
plying our new notion of joint uncertainty not only re¬ 
sulted in a unihed understanding of a large class of UR’s, 
but also opened up the possibility of deriving a new class 
of preparational UR’s, namely identities that are math¬ 
ematically valid for any preparation, but cannot be in¬ 
terpreted based on any single experimental scenario. To 
illustrate, we constructed a class of joint uncertainty mea¬ 
sures with this property, and derived a new UR using one 
of these measures as an example. Finally, we found that 


so-called universal uncertainty relations cannot be found 
over all possible measures of joint uncertainty. We con¬ 
nected universal relations found in past works imz] 
with specific operational interpretations of joint uncer¬ 
tainty. 

In cryptographic tasks we must consider the uncer¬ 
tainty of systems that could be correlated with quan¬ 
tum memories in adversarial control; our recent work 
[50] is a step towards developing a measure-independent 
notion of such conditional uncertainty. More generally, 
a formalism for treating the uncertainty of quantum in¬ 
formation correlated with quantum memories is not yet 
developed. A more complete characterization of uncer¬ 
tainty on infinite-dimensional systems is another chal¬ 
lenging future project. This could impact applications 
of squeezed states, which are ubiquitous in quantum in¬ 
formation processing with continuous variables. Yet an¬ 
other open problem is to improve our understanding of 
universal uncertainty relations; in particular, to answer 
the open questions posed at the end of Section |IV| re¬ 
garding stronger classes of universal relations. Finally, 
there is much to be understood about the classes I?’’®'’ 
and of doubly stochastic matrices. 
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Appendix A: Example uncertainty relation for 
projective measurements on pure qubit states 

On a two-level quantum system (a qubit) in a pure 
state j'i/j), consider two rank-1 projective measurements 
A and B, respectively defined by the orthonormal bases 
1x2)} and {\yi), |j/ 2 )}- When A is applied on \ip), 
the resulting outcome distribution p('0) has the compo¬ 
nents pi{ip) =: _p{'0) = |(xi| V’)|^ and P 2 (V') = 1 - pW', 
similarly, we denote the distribution of the outcomes of B 
q_{tp) = (g(^/’), l — q{tpj). We shall now find a lower bound 
on the minimum joint uncertainty of (p('!/'), q(^)), over 
all pure states \ijj), under the measure 

J2(p,q) = 1-P'^ • (Al) 

Note that is a valid measure of joint uncertainty 
as it satisfies the constraints in Definition 2. We can 
partition the set of all pure states into two subsets iSi 
and ^2 given by 


phases can be added to the measurement basis vectors so 
that 


q{tp) = cos^ a; 

I '12 

p{tp) = I cos a cos /3 -I- sin a sin /3e*'^ | , 
where cos {/3) := \ {xi jyi) j. The minimization of the func¬ 


tion J 2 using its piecewise definition (A2) can be done 


by separately minimizing over and S 2 and then find¬ 
ing the smaller of the two minima. Let us first consider 
. Now, it can be verified that the p dependence of the 
function is through a term of the form /(a, /?) siT?{p/2), 
so that the minimization can be carried out first over p 
alone, and then over all a. In the cases where /(a, /?) > 0, 
the minimim over (j) is achieved when siv?{p>/2) = 1; if 
f{a,l3) < 0, the minimim occurs when s\v?{ip/2) = 0. 
In either case, the minimum over p, as a function of a, 
takes the form 


minj72(V') =: J{a) 

= cos^ a + cos^ (/I ± a) — 2 cos^ a cos^ (/? ± a). 

Since J is even in a, the subsequent minimization over a 
leads to the same value regardless of whether the positive 
or negative sign is used in the ± above. One can check 
that the minimum is attained at a = /3/2, yielding 

min J 2 = \ sin^ j3. (A4) 


{IV')I(P ^ 0-5, q ^ 0.5) or {p < 0.5, q < 0.5)}, Using similar arguments, we can determine the minimum 

S 2 = {\'4’)\{p ^ 0.5, q < 0.5) or {p < 0.5, q > 0.5)}, over the other partition: 


where p and q are understood to be ■i/'-dependent. The 
function J 2 can be defined piecewise using this partition 
as 



p + q- 2pq, 
l-p-q + 2pq, 


I'*/') € 

\^) G S2. 


(A2) 


min ^2 = I- cos^ B. (A5) 

H)eS2 2 

Note that without loss of generality we can take 0^/3^ 
tt/2. Comparing the two local minima, we can express 
the global minimum succinctly as 


Modulo a global phase, |^/>) can be parametrized as 

IV') = cosa|j/i)-be“^sinQ!|y2), (A3) 

with a S [0,7r/2] and p S [0,27r). Appropriate global 


min572(p,q) 


1 

2 




where r] := max^j \ {xi\yj)\. 


(A6) 



